141 research outputs found

    Less Is More: Robust Robot Learning via Partially Observable Multi-Agent Reinforcement Learning

    Full text link
    In many multi-agent and high-dimensional robotic tasks, the controller can be designed in either a centralized or decentralized way. Correspondingly, it is possible to use either single-agent reinforcement learning (SARL) or multi-agent reinforcement learning (MARL) methods to learn such controllers. However, the relationship between these two paradigms remains under-studied in the literature. This work explores research questions in terms of robustness and performance of SARL and MARL approaches to the same task, in order to gain insight into the most suitable methods. We start by analytically showing the equivalence between these two paradigms under the full-state observation assumption. Then, we identify a broad subclass of \textit{Dec-POMDP} tasks where the agents are weakly or partially interacting. In these tasks, we show that partial observations of each agent are sufficient for near-optimal decision-making. Furthermore, we propose to exploit such partially observable MARL to improve the robustness of robots when joint or agent failures occur. Our experiments on both simulated multi-agent tasks and a real robot task with a mobile manipulator validate the presented insights and the effectiveness of the proposed robust robot learning method via partially observable MARL.Comment: 8 pages, 8 figure

    MSS U-Net: 3D segmentation of kidneys and tumors from CT images with a multi-scale supervised U-Net

    Get PDF
    Accurate segmentation of kidneys and kidney tumors is an essential step for radiomic analysis as well as developing advanced surgical planning techniques. In clinical analysis, the segmentation is currently performed by clinicians from the visual inspection of images gathered through a computed tomography (CT) scan. This process is laborious and its success significantly depends on previous experience. We present a multi-scale supervised 3D U-Net, MSS U-Net to segment kidneys and kidney tumors from CT images. Our architecture combines deep supervision with exponential logarithmic loss to increase the 3D U-Net training efficiency. Furthermore, we introduce a connected-component based post processing method to enhance the performance of the overall process. This architecture shows superior performance compared to state-of-the-art works, with the Dice coefficient of kidney and tumor up to 0.969 and 0.805 respectively. We tested MSS U-Net in the KiTS19 challenge with its corresponding dataset.</p
    • …
    corecore